arXiv:1504.05690v2 [math.ST] 19Jun2015 


Noname manuscript No. 

(will be inserted by the editor) 


Empirical likelihood test for high-dimensional two-sample model 


Gabriela Ciuperca • Zahraa Salloum 


the date of receipt and acceptance should be inserted later 


Abstract A non parametric method based on the empirical likelihood is proposed for detecting the change 
in the coefficients of high-dimensional linear model where the number of model variables may increase as 
the sample size increases. This amounts to testing the null hypothesis of no change against the alternative 
of one change in the regression coefficients. Based on the theoretical asymptotic behaviour of the empirical 
likelihood ratio statistic, we propose, for a fixed design, a simpler test statistic, easier to use in practice. 
The asymptotic normality of the proposed test statistic under the null hypothesis is proved, a result which 
is different from the law for a model with a fixed variable number. Under alternative hypothesis, the test 
statistic diverges. We can then find the asymptotic confidence region for the difference of parameters of the 
two phases. Some Monte-Carlo simulations study the behaviour of the proposed test statistic. 

Keywords Two-sample ■ high-dimension • linear model • empirical likelihood test. 


1 Introduction 


The technology development and fast numerical techniques make possible to consider and study statistical 
models with a large number of variables. High-dimensional model refers to a model whose the number p of 
explanatory variables increases to infinity as the number n of observations converges to infinity. When p 
diverges, traditional statistical methods may not work with this kind of growth dimensionality. 

Most of the literature works on high-dimensional model utilize the LASSO (Least Absolute Shrinkage and 
Selection Operator) type metho ds, in order to au tomatically select the significant variables. The principle 
of these methods, introduced bv lTibshiranil ( 1996ll . is to optimize a penalized process, more precisely, a pro¬ 
cess with a Li-type penalty. If the model contains outliers, the parameter estimators by the least squares 
method with LA SSO penalty have a large error. An alternative method is then the penalized quantile 
method. Thereby. IPicker et al.l (201^ consider a quantile model with seamless-Lo penalty when the num- 
ber p of exp l anato ry variables is such that p —>• cxa, p/n —»• 0 as n —» cx 3 . For a general quantile regression, 
IWu and Liul ( 2009ll propose the SCAD penalty, while, in IZou and YuanI ( 2008|l . a composite quantile regres¬ 
sion is considered with an adaptive LASSO penalty. The case p —)• oo is also considered in iFan and Peng I 
fl20n4h for a non-concave penalized likelihood method, when p^/n —exp. C oncerning the group selection 
methods for high-dimensional models, the readers hnd in iHuang et al.l (|2012ll a review of methods. 

Ah these methods are based hrst on the principle of selecting (automatically) the signiheant variables. 
Then, the dependent variable is modeled only as a function of the signiheant variables, in order to have 
more accurate parameter estimators and a better adjustment for the dependent variable. 
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If the goal is to have the most accurate prediction and also robust, in the case of a model with outliers, one 
possibility is to consider the empirical likelihood (EL) method. But, for this type of method, in literature, 
most papers are devo ted to the c ase of fixed p. For a high-dimensional linear regression model, we can refer 
first to the paper of [C mo et all ( 201.'l|l . when the design is deterministic. High-dimensional data are also 
studied bv iLiu et al.l (|2013|l . where EL method is considered for a sequence of i.i.d. random vectors with 
dimension p, when p —>■ oo as n ^ oo. 

In this paper, we are interested by a change-point model, that is, a model which changes at some mo¬ 
ment. The number p of explanatory variables varies with the number n of observations and p can converge 
to infinity if n —>■ oo. 

Since statistical techniques in high-dimension are fairly recent, th ere are not many papers in literature 
that address the change-point problem in a high-dimensional model. iLung-Yut-Foner et al.l ( 201.'lh propose 
an approach for detection of a c hange-point in high-v o lume network traffic. The asymptotic distribution 
of the test statistic proposed in iLung-Yut-Fong et al.l ( 2013|l . under the null hypothesis that there is no 
change-p oint, is th e arqs up of a Brownian Bridge. There are some papers where LASSO type methods 
are used. [Lee et al.l ( 201f)ll consider a possible change-point in a high-dimensional regression with Gaussian 
errors. The main result of the article is to show that the sparsity property is maintained, even if there is 
a change in t he mo del. There is no hypothesis test to decide the presence or absence of change in model . 
In ICiupercal ( 2ni4|l . LASSO-type and adaptive LASSO estimators are studied, while in ICiunercal ( 201, *111 
quantile model with SCAD penalty is considered. These last two papers consider model s with p f i xed. I n 
order to choose the change-point number, a model selection criterion is also proposed bv ICiunercal ( 2014ll . 

To the authors’ knowledge, the EL technique has not yet been addressed in a high-dimensional two-sample 
model, that makes the interest of this work. We study the asymptotic behaviour of the empirical likelihood 
ratio test statistic when the design is deterministic. 


We consider a first linear model: 


Fi = X‘/3-h Ei, j = l,---,n. (1.1) 

Consider now a second linear model which changes at observation k. It is called two-phase model, or model 
with one change-point: 

Y — J + Si, 1 < i < k, , . 

* \xj/32+e^, k<i<n, 

where Xi is a p x 1 vector of p explanatory variables, f3 and /32 are p x 1 vectors of unknown parameters and 
Si designates the model error. The parameter f3 of the first phase of (11.21) coincides with that of (11.11) . For 
models dni and don, Yi is observation i of the response variable. The errors Si are supposed independent 
identically distributed (i.i.d), with mean zero and finite variance cr^. 


We assume that the number p of explanatory variables Xi depends on the sample size n: p = pn, such 
that Pn —>• oo as n —>• oo. The change-point k of (Ea) also depends on n. The change in model dra) takes 
place far enough from the first observation and sufficiently previous to the last observation. So, we suppose 
that limn-^oo k/n G (0,1). 

In this paper, for given k, we use the empirical likelihood method to construct the confidence region for 
f3 — (32, or equivalent to test the null hypothesis of no change in model dEa. Under null hypothesis, the 
model has the form dni), that is 

Ho: (32= /3. (1-3) 

The alternative hypothesis assumes that one change occurs in the regression parameters, that is 

Hi: (32^ (3. (1.4) 

The paper is organized as follows. In Section[awe first present the EL method for the two-sample model. 
Some notations used throughout the paper are defined and needed assumptions for the theoretical study 
are also announced. In Section |a we construct an empirical likelihood ratio test statistic and we study its 
asymptotic behaviour. The asymptotic distribution under Ho of the test statistic is obtained, while, under 
Hi, this statistic diverges. Next, in Section^ we analyse the coverage accuracy and the empirical power by 
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means of simulations, which confirm the performance of proposed test. A new critical value is also proposed 
in order to improve the coverage rate. The proofs of the main results are given in Appendix (Section [5|) 
followed by some Lemmas and their proofs. 


2 Preliminares 


In this section, we introduce the EL method for the two-sample model. Notations and assumptions are 
also given. 

Under null hypothesis Hq, that is model (ini), let /3° denote the true value of the parameter (3. Under 
alternative hypothesis Hi, that is model don, the true values of /3, (32, respectively, are /3°, /S®. 

In order to define the profile empirical likelihood (under Ho and under Hi), we introduce the following 
random p-vector, for all f3 G IH’’ and i = 1, ■ ■ ■ ,n\ 

z.(/3) = X.(y. -x‘/3). 

Consider also the vector 

z° = X,ei. 

We remark that, under the hypothesis Hq, we have z° = Zi(/3°), for all i = 1, ■ ■ ■ ,n and IE[z°] = Op. On 
the other hand, for fixed design (Xi)i<i<„, the random variables z° are independent but not identically 
distributed. 

On the change-point, we consider the notation 9nk = k/n. Thus, in view of the remark made in Introduction, 
we assume that t 0° G (0,1) as n —^ oo. 


For the dependent variable Yi of model (11.211 . let us consider the probability to observe the value yi 
(respectively yj) : qi = lP\Yi = yi], for i = 1,... ,k and qj = lP\Yj = yj], for j = fc-)-l, ■ ■ ■ , n. Obviously, these 
probabilities satisfy the relations J2i=i = 1 ^^nd J2^=k+i © ~ Corresponding to these probabilities, we 
define the probability vectors (gi, ■ ■ ■ , qk) and (qk+i, ■ ■ ■ , q-n)- 
Under hypothesis Hq given by don, the profile empirical likelihood for (3 is 


k n k n k n 

TZnk{f3)= sup sup n = YI = Y = Op}, 

(Pl,--- ,gfc) (Pfe + l,--- ,9n) j = k+l i=l j = k+l i=l j=k+l 

with Op the p-vector with all components zero. 

Similarly, under hypothesis Hi given by (Ha. the profile empirical likelihood is 


k n k n k n 

'Tink{(3,l32) = sup sup n = Y = Op, Y = Op). 

(gi,---,qk) (qk+i,---,qn) ^=1 j=k+i i=i j=k+i i=i j=k+i 

Then, using an idea similar to the maximum likelihood test for testing Hq against Hi, we consider the 
profile empirical likelihood ratio TZnk{l3)/'Hnk{P, (32)- 

Since TZnk{(}, (32) — k~^{n — we have that the corresponding empirical log-likelihood ratio is 


—2 sup sup 

(9l,-" ,9fc) (9fc+l,--- ,9n) 


k n k 

Y log((n-fc)gj);^gj = 

2=1 j = k-\-l 2=1 


n 


5 ] ® 

j = k+l 


k n 

l,^g^z^(/3)= Y ®Zj(/3) = 0p}. 

2=1 j = k-\-l 


Applying the Lagrange multiplier method, the optimal probabilities qi and qj are 

fc-I-nAiZi(/3) ’ ® n — k — n\2Zj{(3)’ ^ ^ 

where Ai,A 2 G IR^ are the Lagrange multipliers. Consequently, the corresponding empirical log-likelihood 
function can be written as 


k n 


(2.2) 
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Taking into account relation (12.11) . the derivat i ve wit h respect to f3 of (12.21) is J2i=i 

Op. We can apply Lemma 4 of IGuo et all (l2013ll on each phase of model, that implies that ||Ai|| = 
and ||A 2 || = Oip{p^^^{n — Then, the probabilities qi and qj of (12.11) are approxi- 

matively k~^ and (n — k)~^, respectively. Thus, we can restrict Ai and A 2 such that 

k n 

fc“^^X,X-Ai = (n-fc)“^ XjX*A 2 . (2.3) 

i=l j = k-\-l 

If the symmetric matrices A;“^ X^X* and (n — X]J=fc+i XjX* converge, as n —>• 00 , to two 

strictly positive definite matrices, then the relation (12.31) can be written Ai = XiX)) ^((n — 

fc)“^E;=fc+iXjX‘)A 2 . Noting by A 2 = X^X*)((n - /c)-i we have the 

new Lagrange multipliers such that Ai = A 2 . For the sake of readability, we denote A 2 by A 2 . 


With this remark, we will restrict the study to a particular case, when Ai = A 2 = A. Considering this 
constraint, instead of statistic ( 1 ^ we consider the following particular empirical likelihood ratio (ELR) 
statistic 

k n 

EL„fc(/3) = 2^1og(l + Ja*z,(/3))+2 ^ log (l --^a‘z,(/3)), (2.4) 

i=l j=k-\-l 

where the Lagrange multiplier A G IR^ satisfies 


E 


Zz{(3) 

k/n + A*Zi (/3) 


E 


j = k+l 


\ — kjn — A*Zj(/3) 


= Op. 


(2.5) 


2.1 Notations 

We provide a brief summary of notations used in the paper. 

For exposition convenience, we define some general notation. All vectors are column and v* denotes 
the transposed of v. All vectors and matrices are in bold. For a vector v, by ||v|| we denote its Eu¬ 
clidean norm and by ||v||i its Li-norm. For a symmetric p-square matrix A = (a^j), let us denote by 
71 (A) > 72 (A) > .. . > 7p(A) the eigenvalues and tr(A) as the trace operator of the matrix A. Consider 
also the following notation Mx(A) = maxi<i^j<p \aij\. We denote by ||A||i = maxj=i^... ,p(Ef=i 
subordinate norm to the vector norm ||.||i. 

All throughout the paper, C denotes a generic constant which may be different from line to line and even 
from formula to formula and whose value is not of interest. 

Moreover, Op denote the p—vector with all components zero. 

At the beginning of this section, the notation O^k = k/n was introduced. To simplify notations, we will 
use the notation 0 instead 0„fc- 

For /S®, the true value of the parameter P on the phase 1, • • • , fc, and the test value under Hq, we define 
the following p-square matrix 




n(l — Oy 




j = k+l 


( 2 . 6 ) 


and the following p-vector 







1 


E 

j = k+l 


n(l — 6) 


(2.7) 
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Under null hypothesis, for the true value /3° of /3, the mean of the random matrix Sn(/3^) is the following 
p-square matrix 


where, for i = 1 ,..., n 


2=1 ^ ^ j=k+l 

v[’i) = Var(z°) = a^XiXj. 


( 2 . 8 ) 

(2.9) 


For i = 1, • • ■ , n, let us also consider the following random vector 

0 \ — 1/2 / /d0\ 

Wi = (V„) ' z,(/3‘'). 

Corresponding to the components of w° = (tc® x: ■ ■ ■: ty°p), we consider for i = l,...,n, for r G N*, 
, ■ ■ ■ ,tr e {1, • • • ,p}, the following scalar 

^*'*' *" = i)r ^ (2-10) 


nO' 

2 = 1 

and the following random variable 

k 


j = k+l 


ti C2 ■ ■ _ 


nO 


1 00 0 I 


n (6 — !)’■ 


E O 0 0 

j = k+l 


( 2 . 11 ) 


where is the r-th component of . In particular, for all ti,t 2 € {1, • • ■ ,p}, = 0, is 

the Kronecker delta, that is = 1 if ti = t 2 , and 0 otherwise. 


2.2 Assumptions 


We now state the assumptions on the design, on the errors, on the number p of the explanatory variables 
and on the change-point location. These assumptions are needed in order to keep the properties obtained 
for EL statistic in a high-dimensional model, without change-point. 

For assumptions (A3)-(A6) the constant q is such that g > 4. 


(Al) There exist positive constants Co, Ci > 0, such that 0 < Co < infn 71 (V°) < sup„ 7 i(V°) < Ci < cxo. 
(A2) lE{ef) < C 2 for some C 2 > 0 and for all n. 

(A3) p~^ Ss=i < C' 3,1 < i < n, for some C 3 > 0, and g > 4; 

(A4) Ujeip'^ < Ci, for some C 4 > 0. 

(A5) p fc( 2 - 9 )/( 29 ) ^ 0 and p (n - ^ 0 , as n ^ 00 . 

(A6) p 2 + 4 /g ^ ^ ^ 0 and {n — k) ^ 0 , as n —^ 00 . 

(AT) = 

(A8) EE.u=i = 0(p5/2) and EE.-=i 

(A9) For alH = 1, ■ ■ ■ , n, for I £ N* , ji, ■■■, ji £ {1, ■ ■ ■ ,p} , and whenever Ei=i ^ 6 , there exists a 
positive absolute constant C 5 < 00 , then ■ ■ ■Wi‘ji) < C 5 . 


Assumptions (A3) and (A 6 ) guarantee that the eigenvalues of S° are close to those of V° (see Lemma 
[ 2 ]). Assumption (Al) implies that V° is uniformly nonsingular and bounded, for large n. Then, for n large 
enough, with probability close to one, S° i s non si ngular and 0 < Co < 7 p(S° ) < 71(8° ) < Ci < 00. Assump¬ 
tion (A3) is also assumed by IGuo et eJ (|2013f) . iHiort et al.l (l2009l) . iLiu et all (120131) for high-dimensional 
model without change-point. Assumption (A4) together with (A3) and (A 6 ) imply supx<x<„ |A*Zx | = Op(l), 
which le ads to Tay l or exp ansions of (lOl) and (E^l) (see LemmaO. Assumptions (Al), (A2), (A4) are also 
used by IGuo et all ( 20131) for linear models without change-point with random design. Same assumption 
(Al) is requ ested in Zi et al.l ( 20121) for a two- sample model with fixed p. Assumptions (A5)-(A9) are also 
assumed bv iGno et all ( 2013 ). Liu et all ( 20131) . in order to have for the asymptotic normality of the ELR 
statistic. 
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3 Main Results 


In this section, we present the main results of this paper. The asymptotic distribution of ELR test statistic 
under hypothesis Hq will allow to build the asymptotic confidence region for the difference of the parameters 
of the two phases of model. We c an also test if the models changes after observation k. In comparison to the 
obtain ed results for fixed p (see iLiu et al.l 1 2008^ . IZi et al.l ( 2012ll for linear model. ICiuperca and SalloumI 
( 2015h for nonlinear model) where the asymptotic law is the distribution with p degrees of freedom, in 
the case presented here, the test statistic is different and it has a standard normal asymptotic distribution. 
In order to find this asymptotic distribution, we first need some intermediate results for studying the asymp¬ 
totic behaviour of the ELR statistic. 

We emphasize that the presence of the break point k complicates the study and leads to a different approach 
in respect to a model without change-point. 


Note that under the hypothesis Hq, we have: Zi(/3°) = z° = Xi£i, while under Hi, the vector Zi(/3°), 
for i = k + 1, ■ ■ ■ ,n becomes 

z,(/3‘>)=X,X*(/3°-/3‘>)-z°. (3.1) 

When Hq is true, we denote by S° the matrix Sn(/3°): 




and by 'i/’° the vector 


E 


n(l-6»)2 z: 

^ ' j=k+l 




(3.2) 




n(l-6l) 

^ ^ j = k+l 


(3.3) 


The Lagrange multiplier A is a key element in any empirical likelihood formulat i on. T he first result 
concerns the convergence rate to zero of A defined in (12.511 . When p is fixed, IZi et al.l (|2ni2ll showed that 
||A|| = 0]p{n~^^^). When p is growing along with n, the above rate for ||A|| is no longer valid as shown by 
the following proposition. In the proof we use Lemma [2l Lemma [3] and Lemma 

Proposition 1 Suppose that assumptions (Al), (A3)-(A6) are satisfied. Then, under hypothesis Hq, the 
Lagrange multiplier X satisfies ||A|| = Oip{p^^‘^n~^^‘^) . 

jp 

Accordingly to this Proposition, by assumption (A6), we have that ||A|| —j-0, as n ^ oo. More precisely, 
the Lagrange multiplier A has the following approximate form given by Proposition [2] The proof, given in 
Appendix, is obtained by combining Lemma |4l Lemma [6] and Lemma [T] The p-square matrix V® is defined 
by (12.811 and the p-vector 1 /^° by (13^ . 

Proposition 2 If assumptions (Al), (A3)-(A6) are satisfied, then, under the null hypothesis Hq, we have 

A = (V0 )-V°(l + Oip(l)). 

We prove now the following two propositions, all satisfied under hypothesis Hq. They give two ap¬ 
proximations for the ELR statistic EL„fc(/3°), defined by (12.411 . approximations which will allow to find its 
asymptotic distribution. 

In the proof of the following Proposition are used Lemma [5l Lemma [6l Proposition [2] and Lemma [3 

Proposition 3 Suppose that assumptions (A1)-(A8) are satisfied. Then, under the null hypothesis Hq, we 
have 

EL„fe(/3‘>) = ni/,°‘(S°)- + ojp(p'/"). 

For the proof of Proposition^ given in Appendix, we use LemmaHl Lemma[Sl Proposition[3and Proposition 

El 

Proposition 4 Suppose that assumptions (Al), (A3), (Af), (A6) and (A7) are fulfilled. If the hypothesis 
Hq is true, then we have 

TLr.{(3°) = nV^°‘(V°)-+ ojpip^/^). 
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The following theorem establishes the asymptotic normality of the ELR test statistic, when dimension p 
of the explanatory variables increases to infinity as n —>• oo. Its proof, given in Appendix, is very technical and 
moreover the change-point presence in the model occurs in an essential way. Proposition [3] and Proposition 
|T]are used in the proof. We note that the variance of standardization depends localisation of the change 
in the interval [1 : n]. 


Theorem 1 Under null hypothesis Ho, if assumptions (A1)-(A9) are satisfied and p = oirfi^^), then 

(^) 

-P £ 


Anjn 




r**; P AAr(o,i), 

ZAn/Tl n^oo 

where AI = af = lE[si] - and: 

- for i = 2, - ■ ■ ,k + l, 


= ^EM(Vn)“'V^.)(V°)-Xo) + 


04 


(3.4) 


for i = k + 2, 


2 

a, = 




6 *2(1-6>) 


1=1 

tf\rO\-lv \2 jrpr^A 


l=k-\-l 


(x((v^)-4xo^ie[4] - 


(1-0)4 

The following result is an immediate corollary of Theorem [1] 

Corollary 1 Testing the null hypothesis Hq : /3 = (32 = /3° against the alternative hypothesis Hi : (5 = 
[32 (3°, is equivalent to constructing the confidence regions for 'y = (3^^ — 0^, or to testing the null 

hypothesis Hq : 7 = Op. Then, based to Theorem^ in order to test Hq against Hi, we consider the following 
asymptotic test statistic 

,Ox _ 'n'tpi{f3°){Vl)~^tPJ[3°) - p 


zm = 


Anjn 


(3.5) 


Note that xp.^{f3°) through Zi{f3°), given by relation (13.111 . for i = k + l, - ■ ■ ,n, depends of 7 = /S'* — /S®- 

The asymptotic behaviour under hypothesis Hi of the test statistic Z{I3°) is given by the following 
theorem. We show that Z{f3°) diverges under alternative hypothesis. 

Theorem 2 Under alternative hypothesis Hi, if assumptions (A1)-(A9) are satisfied and p = o{n^^^), then 

\zm ^ cx). 

n .—>00 

Theorem[T]and Theorem [2] allow to build the asymptotic confidence region for the parameter 7 = /3°—/Sj. 


Corollary 2 The a-level asymptotic confidence region for 7 is 

3^i-a/2 = {1 ■■\Z{^°)\<Ci_^/2] . (3.6) 

where Ci- 0,/2 is the quantile of the standard normal distribution. 

For simulations, in order to calculate 'R.i-a/ 2 , the matrix V)) is firstly calculated by relation (ESI). Once 
the model has been generated, we calculate z° and then Zi{f3°) by relation (13.11) . Finally, we calculate 
by (12.71) . An by Theorem [1] and the test statistic Z{f3°) by (13.511 . For M Monte Carlo replications 
of the model, the coverage rate (CR), is the number of times when \Z{(3°)\ is less than Ci_q,/ 2 , divided by 
M. 
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For applications on real data, we will test model (inD against model HOD. For these models, we know n 
values for the response variable Y and for the p — 1 explanatory variables X 2 , ■ ■ ■ , Xp. The point fc, where 
we want to test if there is a change, is known, while the values of /3° on the hrst phase can be unknown. 

If /3° is unknown, then it is estimated by a convergent estimator on the observations i = I,-- - ,k, for 
example by LS method or quantile method, depending on the distribution of Y. Once we dispose of an 
estimator f3j. for (3°, the variance of e is estimated afterwards by a convergent estimator, for instance 

= (^ “ P)~^ ~ We calculate thereby = dfcXiX*, for any i = I, ■ ■ ■ ,n and then 

V° by dSS. For any i = 1, • • • , n we calculate Zi0f.) = Xi{Yi — XlfBf.), which will allow us to calculate 

the vector of relation (12.711 . With all of these elements in place, we can calculate the value of the 

statistic Z{0i^) = (n'i/j^(,3j,)(V°) — p) ( 2 \„/n)“^, using for An the relation given in Theorem[T] 

For a given size a G (0,1), if the value of \Z{f3f,)\ is less than Ci_q/ 2 , then hypothesis Hq is accepted, that 
is to say that the model does not change after observation fc, otherwise hypothesis H\ is accepted. 

If /3° is known, we can consider as an estimator for o^'. Sri. = {k—p)~^ * = !,■■■ 

we calculate Zi(/3°) = XiiYi — X‘/3°) and afterwards by relation (12.711 . Finally, the absolute value 

of Z{f3°) will be compared with Ci-a/ 2 - 

Remark 1 Compared to \Liu et ai\ \S00fi) . where, for fixed p, a test statistic is proposed for testing the 
presence of the change-point, by maximizing ELR in respect to (3 and \, in the present work we fix the 
param eter on the first phase and we test whether the parameter of the second phase is the same. In \Liu et al\ 
\ 200 A) . the system of equations in X and f3 of the score functions must be solved, which can be numerically 
quite tedious. In this paper, apart from the fact that we consider p —>■ 00, using theoretical properties for the 
Lagrange multiplier X, we propose a simpler form for ELR statistic, easier to use in practice. Parameter 
(3°, if it is unknown, can be estimated on the observations 1, ■ ■ ■ ,k by a simpler computational method, in 
order to obtain (3^ a convergent estimator, i.e. \\f3i. — (3^\\ = ojp(l). 


4 Simulation study 

We now conduct simulation studies to evaluate, in terms of coverage accuracy and empirical power, the 
test statistic specihed by Theorem [T]^ii), with EL„fc(/3°) approximated by Proposition 01 ie Z{(3'^) given by 
relation ( 1 ^ . For these studies, we use Monte Carlo simulations. Throughout, we consider the size a = 0.05. 

The p explanatory variables are generated as follows: X\ = 1 and {X2, ■ ■ ■ ,Xp) ~ A/'p-ifOp-i , X), with 
the co variance matrix S = (c/ij), gni = 2“!^“*^, 1 < h,l < p — 1, the same matrix considered bv iGuo et all 
( 201,*lh . for a model without change-point. In order to be in a hxed design, we consider the same realization 
for (Xi)i<i<„ for each Monte Carlo replication. 

Concerning the coefhcients (3° of the model, under Hq, we take /3° = (/3°, '' i /3p) = {3,2, - ■ ■ ,p). 

For model ( 021 ), under hypothesis Ho '. (3 = 02 = 0^, first calculate the coverage rate (CR) based on 
Corollary 121 for a given change-point k. 

We consider different values for n and k and two different distributions for the errors (si): standard normal 
distribution A/'(0,1) and e ~ Sxp{l) — 1, where £xp{l) is the exponential distribution with mean 1. 


4.1 Importance of assumptions (A5), (A 6 ) 


In this subsection we realise throughout 2000 Monte Carlo replications for studying the behaviour of the 
test statistic behaviour, under null hypothesis and afterwards, when model has a change-point. Coverage 
rate and empirical power are investigated. Values of n and k are n G {20,100,200,400,600,1000}, k G 
{5,25,75,280,350}. 

Analyse of coverage rate. The results are summarized in Table [H where we give CR = 1 - empirical size, 
based on Corollary [2l relation (13.511 . For n and k hxed, the CRs decrease when p increases, this decreasing 
trend being more pronounced in the exponential error case. We observe that whether for exponential errors 
or for gaussian errors, if assumptions (A5), (A 6 ) are not sat i shed, then the CRs are well below 0.95. These 
results are in accordance with those obtained bv ICuo et all ( 2013|l . for models without change-point, with 
hxed design. 

In order to conhrm this supposition, in Table [21 the values of n and k are varied such that 9 = kjn = lf2 
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Table 1 Coverage rates (CR) for Exponential and Gaussian errors. 



Exponential errors 

Gaussian errors 

n k p 

CR 

CR 

20 5 2 

0.94 

0.96 

3 

0.92 

0.95 

5 

0.91 

0.92 

7 

0.77 

0.87 

100 25 2 

0.94 

0.95 

10 

0.88 

0.94 

20 

0.71 

0.84 

200 75 2 

0.93 

0.94 

10 

0.89 

0.93 

20 

0.77 

0.94 

50 

0.73 

0.85 

400 280 2 

0.92 

0.91 

10 

0.87 

0.88 

20 

0.79 

0.86 

50 

0.75 

0.84 

100 

0.60 

0.83 

600 350 2 

0.94 

0.92 

10 

0.92 

0.93 

20 

0.86 

0.87 

50 

0.85 

0.88 

100 

0.78 

0.84 

200 

0.66 

0.84 

300 

0.54 

0.70 

1000 350 2 

0.95 

0.94 

50 

0.86 

0.92 

100 

0.84 

0.89 

200 

0.78 

0.80 

300 

0.52 

0.77 


and p satisfies (A5), (A 6 ). We obtain then that the CRs are larger than 0.90. 

Analyse of power. Under Hi, we consider = 1 — l3°. In the all considered cases, for n, k, p and e in 
Tables [1] and [2] we obtain that the empirical powers are eqnal to 1. 


Table 2 Coverage rates (CR), by 2000 Monte Carlo replications, for Exponential and Gaussian errors, 6 = 1/2. 





Exponential errors 

Gaussian errors 

n 

k 

P 

CR 

CR 


20 

10 

2 

0.92 

0.96 

100 

50 

4 

0.93 

0.94 

200 

100 

5 

0.96 

0.97 

400 

200 

10 

0.90 

0.94 

600 

300 

20 

0.90 

0.92 

800 

400 

20 

0.93 

0.93 

800 

400 

30 

0.90 

0.91 

2000 

1000 

30 

0.91 

0.91 


4.2 CR’s improvement 

In order to obtain more precise false probabilities, for fixed size a, we will calculate, by 10000 Monte 
Carlo replications, the (1 — «/2) and a/2 quantiles, denoted ci, 62 , respectively, for test statistic Z{(3°). We 
consider the new critical value Ci-a ./2 = max(ci, |c 2 |). These new critical values, for p = 50, for different 
values of n and k, for J\f{0, 1) and £xp{l) — 1 distribution errors, are given in Tabled These values are not 
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influenced by value of k, for fixed n. This is observed by calculating C\-a /2 for k such that 9i = kjn = 3/8 
and afterwards we calculate the CRs, denoted CR^ for another k such that 02 = kjn = 5/8. We observe 
that the values of Ci-a.l 2 are larger than the quantile of the standard normal distribution and Ci-a.l 2 are 
larger for exponential errors than those for normal errors. On the other hand, the values of Ci_q ;/2 decrease 
when n (and k) increases and they approach to quantile of A/"(0,1). 

In the same Table, are given empirical powers, denoted tt, calculated for = \ — (3° under ffi, considering 
Ci-a /2 as critical value. We obtain that all are equal to 1. 

If under H\, only two components of (3° change: is such that /32j = f3j for all j £ {1, • • ■ ,p} \ {3, 30}, 

^ 2,3 = f^ 2 , 3 o = f ^30 + 1; always get it = 1. Here we have denoted by the jth component of /Sj. 

Hence, even if there is a small change in the coefficients, most coefficients remaining unchanged, the test 
statistic detects this change. 


Table 3 Empirical critical value Ci _^/2 and corresponding coverage rates (CR), empirical powers (vr), for Exponential and 
Gaussian errors, p = 50, 02 — ^ ~ 0^ • 




Exponential errors 

Gaussian errors 

n 

k 

^l — otl2 1 CR 'k 

Cl — oc 1 CR TT 


200 75 

125 

4.01 

0.97 1 

0.97 1 

3.24 

0.97 1 

0.93 1 

400 150 

250 

3.41 

0.97 1 

0.97 1 

2.96 

0.97 1 

0.98 1 

600 225 

375 

3.40 

0.97 1 

0.97 1 

2.85 

0.97 1 

0.98 1 

800 300 

500 

2.68 

0.95 1 

0.95 1 

2.43 

0.97 1 

0.95 1 

2000 750 

1250 

2.48 

0.97 1 

0.95 1 

2.30 

0.97 1 

0.93 1 


4.3 Conclusion of simulations 

Proposed test statistic (1531) . with A/'(0,1) the asymptotic distribution under Hq, involves the construction 
of a confidence region for the parameters of the second phase of the model (on observations fc + 1, • • • ,n). 
If assumptions (A5), (A6) are satisfied, then the coverage rates are close to the nominal coverage level. Con¬ 
trariwise, if the coefficients change on the second phase, the test always detects this change. For improving 
the coverage rate in the case n — k ^ oi k ^ , we proposed to calculate new critical values. With 

these critical values, the rate of false changes is generally smaller than the size a. If there are changes in the 
coefficients of the second phase of the model, the test statistic based on the new confidence region always 
detects this change. For fixed p, if n and k increase, such that A:/n=const ant, then these new critical values 
decrease and approach the (1 — a) quantile of A/"(0,1) distribution. 


5 Appendix 

This section is divided into two subsections. In the first we give the proofs of the Propositions and of the 
Theorems. In the second subsection, we present Lemmas ans their proofs. 

We recall that under the hypothesis Ho, the vector Zi(/3°) is z/ = Xi£i. Then, in the all proofs, if 
hypothesis Hq is true, we will use z/ instead of Zi(/3°). 
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5.1 Proposition and Theorem proofs 


Proof of Proposition [TJ Let ns write A as A = ||A||u, where u is a p-vector with norm one. Using Lemma 
[31 for 0 = k/n, we have, for any i = 1 ,..., fc, with probability one 

0< i + Mu‘z° < 1 + ||A||T°, (5.1) 

where T° = max i^i^k 1|, (1 — ||}. For j = k + 1,... ,n we have, with probability 

1 , 

Using relations (EH) and (El, then we get from ( 1 ^ that 


1 ^ to 

o = -y ^ 

n y 6 + A*z? 

Z =1 ^ 


1 

-- E 


u‘z° 


n . ^ 1 - 6 * - A*z° 

j=k+l 3 


= 


1 


—' 1-h 6 >-i||A||u*z° n{l-6) _^ 


E t 0 

u z, - 


1 


t(l- 0)2 


ll^ll E TVTT^ 


u*z°z°*u 


i=l i = l M M Z V ^ j — 

By the last equality, using also notations given by (El and (El, it follows that 

IIAI 


l-{l- 6 ») i||A||u‘zO' 


0 < uV° - 


t c? 0 

u Sr,u. 


1 + l|A||TO 

Then, we have with probability one, that u*?/)))(1 -|- ||A||T°) > ||A||u*S)(u. Therefore 


lAII < 


u‘SOu-u‘V>OTO- 


(5.3) 


On the other hand, we have In*!/;® | < ||i/’° ||. Then, using LemmaE we obtain that Hi/j))|| = Oip(n“^/^p^/^), 
which gives 

\i (5.4) 

Using Lemmaldland relation (15.411 . we have that = Ojp(n~^/^p^^^)ojp(n^/®p^/^) = ojp(n^~®“*"^^^^®p). 

Then, by assumption (A5), we obtain that = ojp(l). 

On the other hand, according to Lemma E u*S° u > 7 p(S°) > Co > 0 holds with a probability tending to 
1 as n — 1 - oo. Then, for relation (15.31) . we obtain that 

||A|| = 0^(|uV°|/Co) = OjpiWyj) = Ojpyy-y. 


Proof of Proposition E By LemmaEwe have that A = (S°) ^ (R° + ^n) (1 + 01 P( 1 ))- 

In the other hand, by LemmaE we have that II = Ojp(p^^^n“^/^). Using this fact and relation (I5.64L 

we obtain 

||R-n|| / 1/n 1/2 -1/2 l/2^ / (2-n)/2nx 

= ojp(n ^ n ! p ! ) = op{pn'' "). 

IICII 

Therefore, by assumption (A5) we obtain that R° = i/j^Oip)!). Then A = (S° )“^'i/j° (1 -|- Oip(l)). 

In the other hand, by LemmaE we have that ((S°)“^ — (V^)"^)!/))) = (V° )“^’j/)°Op(l). Then, = 

(V° )“^'i/j° (I -|- ojp(l)). Therefore, for A = (S° )“^'i/j° (1 -|- ojp(l)), we obtain that 

A=(V°)-V°(l + o^,(l)). 


(5.5) 
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Proof of Proposition [3l By Lemma[5]we have that 


EL„fc{/3‘’) = 2nA‘'0° - nA*S° A + f 3 + ojp(1), 

with£ 3 s|(iy;(z?‘A)’-— t. 

i=l ' ' j=k+l 

Consider now, the following p-vector 


R° = 


nO^ 


E 


„ 0 /\£„ 0\2 


E 


n(l — 9)^ ^ 

^ ’ j=k+i 


Zj (A Zj ) . 


(5.6) 


(5.7) 


By Leninia[6]we have that A = (S°) ^(R° + 'i/’°)(l + ojp(l)). Then, we have for (15.61) that 

EL„fc(/3“) = n-0°‘(S°)-- nR°*(S°)-'R° + ^3 + ojp(1). (5.8) 

We now stndy £3 and n(R°)‘(S°)“^R° in parallel. By Proposition [2] we have that A = (V°) + 

ojp(l)), which implies 

z°*A = z°*(V°)-V°(l + ojp(l)). (5.9) 


Then, £3 becomes 


ft = JsE t 


+ 


i=l 

2 


nO ■ 


niO-l) ^ 

'■ ’ i=fe+i 


(1 + Ojp(l)) 


3(6*-1)3 


f: (zf(V°)-V^(V°)-V^(^^z° + -^^ X: -?))'(!+ oip(l))- 


(0-1) 


j = k+l ' 1=1 j = k + l 

Using notations given by (iTTOl) . (ITTTIi and the strong law of large nnmbers (Markov’s Theorem), we obtain 


ft = |E(”'?'(;sE’'? + ;;( 8 ^ E E + E ’'?)) 

i = l i=l ' ' i = k+l ' ' l = k+l i=l ' ' j=k+l 


2 n 

T 

2n 


p 1 ^ 1 " 

E r s u/ \ '' 0 0 0 I ^ \ '' 0 0 0 Wi I /i\\ 

^ ^ +:^ 7 oz:J )3 wj,,.wj>w,,„)(i+ ojp(i)) 

C ^l=^ 7 - 1 ^ ' j=k-\-l 


r,s,u=l 
P 


E r s u rsu /-i , /1 \ \ 

uj uj u! a (l + ojp(lj). 


r,s,it=l 


In the other hand, replacing A in relation (15.711 we obtain 


= fJ- 

" ln03 


E 


z°(z°V°‘(V°) 


- 1\2 


1(1 -0)3 


z°(z°V°‘(V°)-i)"Vl + ojp(l)). (5.11) 


j = k+l 


Consider now for n(R„) (S„) ^R„ the following decomposition 


■p0£ / qO { /'Q^ \ _ {\7'^ j-R^^ fX/'^ ^“^R^ 

\^n) \ ’ nj ) n) ^ri’ 


(5.12) 


By Lemma[7Kii), we have that ((S°) ^ — (V°) ^)R° = (V°) ^R°Ojp(l). Then, relation (15.1211 becomes 
R°‘(S°)-iR° = R°*(V°)-iR°(l + ojp(l)). Using relations (E^, dEZI), (l5lTll and the fact that = 


^(1 + Ojp(l)) 


(5.10) 
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(V°) for i = 1, ■ ■ ■ 


n, we have that nR°‘(S°) can be written 

k ^ k 


/■ k k 

Tj0t/c0\-iT30 _ J 1 1 1 „_o\t„_„on2 

t,R„ (S„) R„ — i 03 [[^q 2^ Wi 

*' 1 = 1 i=l 


^( 1 - 0 ) 


w°)‘w?; 


j=k+i 


\t 0l2 

W; 


-(T^ S "J)' 

^ ’ l = k-\-l i=l ^ ’ j = k-\-l 

^ ^=1 2=1 ^ ^ j = k-\-l 

-^(138)5 E "i’K^eE”'--^!^ E w»)‘wr]“)}(i+„Hi)r 

^ ' Z=fc+1 2=1 ^ ^ j = k-\-l ' ^ 

Thus, using notations given by (|2.10p and (|2.1ip . we obtain 


nR°*(S°)-'R° = n ^ < 

r,S,Z, 22,21 = 1 






0 0 


i{l-6) 


E O 0 On 
j=k+l 


1 

u V [ 0 0 


0 1 
nil — 


1 " 

(l_g]3 E wluwlywli]{l + oip{l)) 
'' ' 3 = k+l 


P 

E rsl uvl r s u Vf-t . /i\\ 

a. a ij (jj u Lo (l + ojp(l)j. 

r,s,l^u,v = \ 


In conclusion, for £3 of (ICTIll and for nR°*(S° )~^R° . using assumptions (A 6 ), (A7) and (A 8 ), together with 
the proof of Proposition 1 of iGuo et all ( 2013r) . we obtain: £3 = ojp(p^/^) and nR°*(S°)“^R° = Ojp(p^^^). 
Combining the last two relations together relation (ESI), we obtain that 

EL„fc(/3°) = (S°)-+ oAp"'^)- 


Proof of Proposition [4l We first prove 

= Oip(p^/^). (5.13) 

For this, we introduce the following two p-square matrices 

B° =(V°)-l/^S°(V°)-l/^ K°=Ip-B° 

and the following p-vector 

rl = (V°)-i/V^ 

With this notations, the left hand side of relation can be written 

- (S°)-i)^° = nr 7 °*(lp - (V°)i/^(S°)-i(V°= nr,°*(K°)-\° . 

We consider the following decomposition for n 77 °*(K° 

{K„) 77„ = [nrj^ K„r7„-n77„ (K„) tj„-(-1) nrj^ (K„) r7„j+(-l) nrj^ {K„) (Ip-(B„) )tj„, 

(5.14) 

for any 6 gN*. 

We will study the convergence of the expansion given by (15.141) . By Lemma 6 of I Chen et al.l ( 2009(1 . we have 
the inequality 'q^Arj^ < || 77 ° |p(tr(A^))^/^, for any symmetric matrix A. Then, for the first term of the 
right-hand side of relation (15.1411 we have that, with probability one: 

-0n2\1/2 


Ot-ryO 0 ^ II 0 11 2 / I / TyO \ 2 \ 

n77„K„77„ < n \\r]J\ (tr(K„) ) 


(5.15) 
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Using assumption (Al) and Lemma 4 of iLiu et al.l ( 2013h . we obtain that, with probability one: 

1 n , 0 ||2 ^ 1 


|r7°f = < 


71 (VO) 

By LemmalU we have that Hi/j® || = and thus 




nil — 




\V°jf = Oip{pn ^). 


(5.16) 


By Lemma [H for 1 < r < p, we have 


7,(K°) < (tr(K°)^)V2 = Ojp{pn-^/^). 


(5.17) 


Using relations (15.1611 . (15.1711 and condition p = o(n^^^) obtained by assumption (A6), we have for (15.1511 
that 

nr7°*K°r7° = n Oip{n~^p) 0]p{pn~^^^) = Ojp(p^n“V2) = oip(pV2). (5.18) 

On the other hand, using relations (I5d^ and (ISTfll . we obtain 

|r7°*(K°)'’77°| < ||r7°f max l7r(K°)'’| < ||r7° f (tr(K°)^)V2 = Ojp(pn“^)Ojp(p'’n“V2)^ 

1 <r <p 

which gives 

r,°‘(K°)'’r,° = Ojp{p^+\-^^+^'^/^). (5.19) 

The last equation means that the series 1)*’“^?7°*(K°)*’?7° is convergent for fixed n when p = 

o{n^^^). Then, taking also into account relation (15.1811 . we can conclude that 


E / l\t*—1 Ot /-ryO \b 0 / l/2\ 

(-1) Vn (K„) = ojp(p ). 


(5.20) 


The remaining task is to prove that the term nrj^(K^)^(Ip — (5°) ^)t 70 in (15.1411 is negligible as 5 ^ oo. 
For the last term of (I5ia . we have that 


\nrjl\K°S{Ip - (B0)-^)r70 I < |nr;0*(K0 )''r70 1 + |n(r,o )*(K0 )'’(B0 )-^r;' 


>0 \-l\_0 


0 t/T^0\b 0| 


0 \t/T^0\b/'t^0\-l 0 I 


(5.21) 


For the first term of the right hand side of (15.2111 . by relation (I5.19L we have, with probability one, that 
<(K0)^r,° = Ojp(p^+in(-^-2)/2). Then 


Ot /TyO \ b 0 — 6/2\ 

nVn (K„) = Oip{p ^ n ' 

For the second term of the right hand side of (imil . we have, with probability one, that 

rf(K°)'’(B°)-i77° I < n||r7° ||^Mk((K°)'’(B°)-1). 


(5.22) 


(5.23) 


Furthermore, according to Lemma 4 of iLiu et al.l (l2013h . for any p X p symmetric matrix A = (uij ), we have 
Mx;(A) < maxi<i<p |7i(A)|. Then, with probability one. 


Mc((K)()“(B)i)-") <p-Mc(K)()''-Mc(B(()-" <p- max l7r(K)()“| • max |7r(B)() 

l<r<p l^T'^p 


0 7-1 1 


By LemmalHl we know that |7r.(K°)| < (tr(K° )^)^/^ = Ojp(pn“^/^) and then, |7r(K°)^| < (tr(K°)^)^/^ = 
Ojp(p*’n“^/^). On the other hand, by Proposition|2]it is clear that 71 (B°)“^ = 7i((S°)“^V°) < 7i(V°)/Co, 
with probability tending to one. All these imply Mx((K°)^(B°)“^) < pO]p{p^n~^^^). Then, we obtain 

^Fc((K°)‘’(B°)-l) = Oj,(p'’+V-V2). ^5 24) 

Combining relations (15.1611 . (15.2311 and (15.2411 . for the second term of the right hand side of (15.2111 . we obtain 
that 

|nr,°‘(K°)'(B°)-ir,°| < np||r,° f Mx((K°)'(B°)-i) = Op(p'+^n-V2). 


(5.25) 
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By relations (15.211) . (I5.22L (15.2511 and assumption (A6), it follows that 

-h = ojp(l). (5.26) 

Combining the results obtained in relations (15.201) and (15.2611 . we obtain (15.1311 . 

The Proposition follows combining relation (15.1311 and Proposition [3l ■ 


Proof of Theorem [TJ Since 6 = k/n —i 0° G (0,1) and the point k, where the test is realised, is known, 
we suppose that 6 is 0°, then it is fixed. 

(i) We p rove relation (13.4L by co nstructing a martingale and applying the martingale central limit 
theorem Isee lChow and Teichei ( 1997lP . We will prove this relation in four steps. In Step 1 we construct a 
martingale, in Steps 2 and 3 we propose two sufficient conditions for applying a central limit theorem and 
hnally, in Step 4 we prove relation drill . 

Step 1. In this step, we will construct a martingale. 

For i = 1, • • ■ , n, let us dehne the following random vector sequence: 

if j < fc. 


G° = 


1=1 
k 




1=1 i=fc-i-i 

and also the random variable = ||G°||^ — ip. 

Then, the left hand side of drill can be written 

-P 


The relation between G° and G°_i is: 


G° = 


with Gn = 0. 


Anjn 



G^i + 

w? 

6 ’ 

if i < k, 

G° 1 

w° 

if i > k, 

^2 — 1 

1 -0’ 

cr(w?,-- 

■ ,w°) 

= cr(Gi, 


generated by Wi,-- - ,w° or by Gi,--- ,G°. Firstly, we study if is a martingale. For this, 

- ip / Hi_i. 


consider for example i such that i > k. Then 

= ||G°_i||" + 




{i-ey 


Consequently, {H^ is not a martingale. We will now construct a martingale based on G° with respect 

to the filtration {-Fi}i>i. For this, we define the following random variable sequence Ui = — H^_i, for 

i = 1, ■ ■ • , n, with Hq = 0. Then 




U°={ 


0||2 


6*2 


2 + 


, 1-6 * ' ( 1 - 0)2 
We consider the following two random variable sequences: 

r° = U°- 1E[U°], 


if i < k. 


— p, if i > k. 
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and 


1=1 1=1 

For all i such that i < k we have that the condition expectation of tpi given the cr-field is: 

= iiG°_if + ElMfl -^p-j 2 mu°] = iiG°_if mu°] = 


1=1 


1=1 


and for alH > fc: 


]E[p°\J^,-i] = ||G°_i||" + 


^[||w°|p] 

( 1 - 0)2 

Thus, {‘fi}i>i is a martingale with respect to 


-ip-Yl ■®'[^1 ] = ‘^i-i- 
1=1 


To apply the martingale central limit theorem of IChow and Teiched ( 1997ll for , J^i, i > 1}, it suffices 
to show that 


and 


= o{AI) 


J2lE[\E[{T°f\T,-i] - a^l] = o{Al), 


(5.27) 

(5.28) 


Step 2. In this step, we will prove relation (15.2711 . 

In order to facilitate writing, for i = 1, • • • , n, we denote 


0 _ w° w° 

Nj = -^^i<k - ^ _ g ]Iz>fc- 

Then, the random variable can be written 

T° = 2G°iiN° + ||N°f - IE[||N°f ]. 


(5.29) 


(5.30) 


By assumption (A9), for some positive absolute constant C 7 < 00, for alH = 1, ■ ■ ■ , n and all ji, ... ,ji = 
1 ,... ,p, Z G N, whenever Yl\=i have 

<G 7 , (5.31) 

with is the ji-th components of the vector N° defined in (15.2911 . By the Holder inequality, for any 

b < 3 we have 

l?[||N°f‘'] < ClE[{^wf^y] < Cp'’-^^]E[wf^] < Cp\ 

1=1 1=1 

By the Cauchy Schwartz’s inequality, we have that 

lE[NTGtif < 

On the other hand, by Lemma 7 of iGuo et all ( 2013ll . we have 

IE;[N°*G°_i]® < Gp®(i^ + i^ + i). (5.34) 

Then, by (15.3311 and (15.3411 . we obtain that ^?[N°*Gi_i]^ < [Cp^{i^ + «^ + This implies that 

IE;[N°‘G-_i]^ < + i + (5.35) 

On the other hand, we have 


(5.32) 


(5.33) 


IF[||N°||" - IF[||N°||2]]^ = IE I ||N°f - 3||N°||^IF[||N°||"] + 3||N°f (IF[||N°f ])" - (IF[||N°f ])31 . 
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Using (I5.32L we obtain that 

|li;[||N°f - JE[||N°f ]]^| < Cp\ (5.36) 

Using relations (15.3511 and (15.3611 . we can write ^t'[|rfp] < Cp^(l -(- t Then < 

Cnp^{\ + + n + which gives 


< Cp^{n^^^ + ri^ + n). (5.37) 

i=l 


On the other hand, using assumption (Al), by similar arguments as 
we have that > Cn?p -|- Op{p'^n). Then 


for relation (16) of iGuo et all ( 2013ll . 


Al > GnV/" + Op(p"n3/"). 


(5.38) 


From relations (|5.37p and (|5.38p . we obtain 


-0|3i 




,3„5/2 


A3 


n3p3/2 


C»p((p^/n)^/^). 


Since p = o{n}^^), relation (lOTIl follows. 

Step 3. Now, in this step we prove relation (15.2811 . 

By elementary calculations and using relation (15.3211 . we obtain (rf)^ = N°*Gi_i-|-4G°li N°(||N°|P — 

^[||N°f ]) + Ojp(p2). We observe also that erf = E[{T°f]. Then 

fU[(r°)"|J-._i] =4G°iiJE[N°Nf‘]G°_i+4G°ii.E[N°(||N°||"-fE[||N°||"])] +Off.(p") (5.39) 

and 

al = 4JE[G°iilE[N°N°*]G°_i] + JE[(||N°f - JE[||N°ll"])"] + 0(p"). 

By inequality (15.3211 we have JE[(||Nf |p — JEi[||Nf ||^])^] = O(p^). Then 

af = 41E[G°iifE;[N°N°*]G°_i] + 0(p"). (5.40) 

Using relations (15.391) and (I5.40L we obtain that, for any i = 1, - ■ ■ ,n, 


E[\E[Tf\T,-i] - alf] < 16hE[G°iiJU[N°N°*]G°_i - JU[G°iiN°N°‘G°_i]]' 


Before analysing the terms Ai and A 2 , we note that 


1 


fE(N°N°‘) = (V°)-i/^[^]l,<fc + ^-^l,>fc]v^,)(V°) 
In order to facilitate writing, we consider the following matrix 


Ox-l/2 


M° = 


1 ^ 1 
H 


(1-0)2 


^>k I Vf,) 


+li;[N°‘(||N°||"-IE[||Nf||"])]JE[G°_iG°ii]lE;[N°(||N°||"-IE;[||N°||"])] +0(p^) 

= 16(A-hB + 0(p^)) (5.41) 

For the term A of (15.411) . we have the decomposition 

A = JE[G°iiJU[N°N°*]G°_i]" - (lE;[G°iiN°Nf‘G°_i])" = Ai - A2. 


(5.42) 


(5.43) 


Then, can be expressed as: 

IE[N°N°*] = (V°)“^/^M° (Vf)“^/^. 


(5.44) 
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Hence, the term Ai of (|5.42L can be written 

Taking into account that the random vectors w° are independent, with mean zero, for all z = 1,..., n, we 
can decompose Hi as 

Hi = Hii + Hi2 + Hi3, (5.45) 

with 

-if i — 1 < k: 






h^l = l 
i-l 




h^l = l 




^ = 1 


-if i — 1 > k: 




+ 


h^l=l 

1 

(T^ 






^12 = ^ E Je[w^(v°)-i/^m°(v°)-i/Vw^*(v°)-'/^m°(v°)-'/V] 


h^l = l 


+ 


{i-&r 


J2 




h=l ^ ' h' = k+l 


We study Hu, H 12 , H 13 . For this, we consider the case z — 1 > fc, the other is similar. 
For Hii, applying Cauchy-Schwarz inequality, we have 


^ = 1 
k 


h' = k+l 




^ = 1 


h' = k+l 


Then, since (£ 2 )i<i<n are independent, we have: 


^i3 = ^(Et3(EM(V°)“'M°(V°)-i))E^^^( ^ tr(V^,,){V°)-iM° (V°)-i))'. (5.46) 


h=l 


h' = k+l 
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Similarly, we have for A 12 : 




h^l = l 


+ (T3^^[ E 


h'^l'=k+l 


= ^1e\ w^(V°)-i/^M°(V°)-i/^1?[w?w?*](V°)-i/2m°(V°)-i/V^ 


h^l = l 




h':^l' = k^l 


< ^P*^7i(V°) \ sup (7i(V°i))7i(V°^))) -7^1^^°) 


h,ie{i, - ,fc} 

>2 4,,.0n-1 


+7r^p(*-i-^) (7i(V^F))7iK/.')))-7r(M?). 

U /i',i' 6 {fc+l,- - 7 - 1 } 

Taking into account assumption (Al), we obtain that 


Ai2 < Cp{i — 1)^. 


(5.47) 


For the term A 13 of (15.451) . we have 


1 __ 1 


h=l j,l,s,t=l 
i-1 


+ 


h' = k + l j,l,s,t=l 

Taking into account assumption (A9), we obtain that 


( 1 - 0 )^ 


Ai 3 < ^Cp"fc7f(V°)-Si^(M°) + - fc)7f(V°)-ShM°). 


Using also assumption (Al), we obtain get 


Ai3 < Cp^{i — 1). 


(5.48) 


For the term A 2 of (15.4211 . by similar calculations, we obtain that 
-if i — 1 < k: 

o — ^ 

1 


= ^(EthVw(V°)“'M°(V°)-i))^ 


if i — 1 > k: 

k 


^2 = ^(Etr(V^^)(V°)-iM°(V°)-i))E-^( ^ tr(V^;,,)(V°)-iM° (V°)-i))'. (5.49) 


(1-0) 

h=l ^ ^ /i'=fc+l 

For i — 1 > fc, by relations (15.461) and (15.491) . we obtain that 

All ~ A 2 < 0 . 


(5.50) 


Inequality (15.501) is also true for i — I < k. 

Then, since the term A of (15.421) can be written A = (An + A 12 + A 13 ) — A 2 , combining relations (I5.47L 
(15.481) and (15.501) , we obtain that 


A < Cp(i- 1)^ + C'p'‘(i- 1). 


(5.51) 
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For the term B of (I5.41L in the case i — l>k, taking into account the fact that ^(N°) = Op, lE(w^) = Op 
for all h = 1,... ,n and the fact w° is independent of w°/ for h ^ h', we have that 


h=l 


h' = k-\-l 


h=l 


h' = k-\-l 


= ||N°||2] ^ E[wU°^] fE[N°‘||N°||^] + ^ fE[N°||N°||^] 


h,l = l 


(1-0)2 


h' ,1' =k-\-l 




u,s=l l,r=l h=l 
P P 


+ 




( 1 - 0 ) 


u,s = ll,r=l h'=k-\-l 


where 






'j is the (u,s)-th element of the matrix E^=i V" Using 

y US ^ ^ 


Lemma 4 of iLiu et al.l ( 2013|l and relation (I5.31L we get 


u,s = l l,r=l h=l 

p p i — 1 


(1-0) 




u,s=l l,r=l h' = k-\-l 




C 


^p^(fe7i(V„U2) (^^(v^*^))) + --- 1 - fc)7i(V„U2) g^p 

he{i,- -,k} (-L - ;i6{fc+i,-.-7-1} 

Then, by assumption (Al), we obtain that 


B < Cp^{i - 1). 

Similarly, we can prove that inequality (15.521) is also true for i — \ <k. 

In conclusion, combining relations (I5.41L (15.5111 and (I5.52L we get that 

E[\E[{T°f\T^-l] - a?I"] < Cp{i - if + Cp^{i - 1) + 0{pf. 

Then, by Cauchy-Schwarz inequality, we have 


(5.52) 


Er. 


r.. E[|B|(r»77._.] - 71] ^ Er.. (E[|E[(r»)“7._.] - 7|=]) 


1/2 


A 2 


< 


^-»TJ 


Cn{v}^^p'^ + p^l'^n + 0{p^)) 


< C 


C n^p + 0]p{p‘^n) 

3/2 2 
n ' p 


rBp 


and hence the relation (15.2811 follows. 

Step 4- In this step, on the basis of the central limit theorem for martingales, we will complete the proof 
of relation dUl). On the basis of r elations ( 1 A 271 ) and ([ 5 j 2 ^ proved in Step and Step 3 , applying the 
martingale central limit Theorem of I Chow and Teiched (Il 997 ) (Theorem 1 , page 336 ), for J>i}n>i we 
get: 

u—^oo 


AA(0,1). 


(5.53) 
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Using notations given in Step 1, we have n ^'4’n ~ p) = ]E[Ui]. In the other 

hand, since Hq = 0, we have ^[Ui] = E[H^] = _2?[||G° ||^] — np. But, taking into account relations 

(1^ and (ESI), we get: 

i=l ' j = k-\-l 

2 fe 2 

= ^E(xUv°)-^E((v^)“'^'E + 7r^ E (x‘(v°)-i/2)((v°)-i/%) 

= ^Etr(^(XUV°)-^/^)((V°)-^/^X,)) E tr(^(X‘(V°)-i/^)((V°)-^/%) 


= nV((V°)-i/^(^^X.X‘ + ^^^ ^ X,X‘)(V°)-i 


-1/2N 


i=l 


{i-ey 


j=k+l 


= ntr((V°)-i/M(V°)-'/") 


= np. 


Thus lE[Hn] = 0. Then, taking into account relation (15.531) . we obtain claim (13.411 . 

(ii) The assertion results from (i) combined with Proposition [3] and Proposition U] 

Proof of Theorem By elementary calculations, we have under hypothesis Hi, with probability one: 

j2 X,X‘)(/3°-/3°). 

^ ^j=k+i / 

Also, under Hi, since IEi[z°] = Op, using assumption (Al), we have 

= E x,x‘)(/3°-/3‘>)(i + o^(i)). 


Then, 


x,x‘'^ 


)(1 + ojp(l)). 


mPi{f3°){V°^) >J/3°)= (ntA°‘(V°) + n\{f3° - 

> j=k+ 

The test statistic becomes 

An/n An/n 

Since [32 ^ (3°, using with Theorem[Tl together assumption (Al), we have that |2(/3°)| —>• oo, in probability, 
as n —>• oo. ■ 


■t\2 

3 I 


5.2 Lemmas 


In order to prove Propositions [T] [2] El 0] and Theorems [T] an d[2l we need the following lemmas. 

The hrst four lemmas establish equivalent results obtained bv IGuo et all ( 20131) for a linear model without 
change-point. 

Let us consider the following p x p matrix L„ = S° — V°. For 1 < u, u < p let us consider Ln-u,v the (u, v) 
element of the matrix Ln. Let also consider the largest absolute element of Ln : maxi<„ 


yZiu,v)\, with 


,v) a^nd 


denote the {u,v) components of S„ and V„, respectively. 
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Lemma 1 Under null hypothesis Hq, suppose that assumptions (A3) and (A4) are satisfied. For any e > 0, 
there exists a positive constant Cq that depending only on q > A, such that 


Proof. The matrix L„ can be written 






{i-ey 


j=k+l 


The (rt, v)-th element of the matrix L„, for 1 < u, r < p, is 

k 


Ln,{u,v) — '^^i,uXi,v{Si CT ) + ^ Xj,uXj^v{Sj O ) _ 

i=r ' ’ i=k+l 


4'k.)- (5-54) 
( 2 ) 


Since 9 = k/n —>■ 0° G (0,1) as n —> oo, we can apply Lemma 1 of iGuo e~ 3 (l2013h for and 

Then, for fixed 9 G (0,1), for all e > 0 there exists two positive constants Cq^'^ and Cq'^'^ such that: 


and 






P 




P 


nil'^et 


(5.55) 

(5.56) 


Then the Lemma follows from relations (I5.54L (15.5511 and (15.561) . considering Cq = max {Cg^\ ® 

By the next lemma we prove first that all eigenvalues of S° converge to those of V° uniformly with 
the rate Op{p maxi<„^.„<p — l^,?,(„,.u)|) and then that all eigenvalues of S° are bounded and strictly 

positive for n enough large. 

Lemma 2 Under assumptions (Al), (A3), (A4) and (A6), there exists two constants Co,Ci > 0 such that 
the inequality 7 p(S 5 () > Go and 71 ( 8 °) < Gi hold with probability tending to one as n 00 . 

Proof. Arguments based o n the properties of the trace and of the eigenvalues of symmetric square matrices, 
together with Lemma 2 of iGuo et 3 (|2013|l imply that 

max | 7 r(V°) - 7 r(S°)| < p max 15°,P^,(„,„)|. (5.57) 

l<r<p ^^u,v<p 

Lemma follows by combining this last relation with Lemma[T]and assumption (Al). ■ 

Let us consider the following random variable T° = max i^i^k ^ ||, (1 — ^)~^||zj|| L 

Lemma 3 Under the null hypothesis Ho, suppose that assumptions (A3) and (A 4 ) are satisfied. Then, for 
q > A, we have T° = op{p^l‘^n^l‘^). 


Proof. Since 9 = kjn —>■ 0° G (0,1) as n —>• 00 . we have with p robability 1, to 
T'n Gmaxi<i<ji ||z°||, with G > 0. By Lemma 3 of IGuo et all ( 201311 . we have that: 
op(jfil'^n}l and the lemma follows. 


00 , we have with p robability 1, for enough large n: 

maxi<i<„ ||z?|l = 


By the following Lemma we give an asymptotic approximation for the L 2 -norm of the vector 'i/Jjj(/ 3°), 
given by (1^ . under hypothesis Hq. 

Lemma 4 Under the null hypothesis Hq, if assumption (Al) holds, we have 11^/^° II = Op{p^^^n~^^^). 
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Proof. Let is the p x k design matrix whose k columns are Xi, for i = 1,... ,k and X*-^^ is the 
p X {n — k) design matrix whose (n — k) columns are Xj, for j = fc -|- 1,..., n. Since (si) are independent, 
we have that 


^ ^ '' j,y=k+i 




„0t0 1 
Zj Zjf\ 


a 

n 




X 


( 2 ) 




Using assumption (Al), we obtain that 

+ = 0 (pn-i). 

Then, ||t/,0|| = Ojp(pi/2n-i/2). 

The following Lemma gives a first approximation for the EL statistic, under hypothesis Hq. 
Lemma 5 Under the same assumptions as in Proposition [3 if hypothesis Hq is true, we have: 

EL„,{/3°) = 2nAV°-nA‘S°A+|(^^(z°‘A)3-^^^ (z^Af) + o^(l). 


j = k+l 


with 6 = kfn. 

Proof. The limited development of the statistic ELnfc(/3°) specihed by relation (12.4L in the neighbourhood 
of A = Op, up to order 3 can be written 


^ 2 = 1 J = fc+1 / ^ 2 = 1 ^ ^ j = k-\-l 




3\e3 

^ i=l 
= £\ — £2 S 3 £ 4 , 


j=k+l 


4! 


a^EL„fc(/3°)(A„^.,) 

dXudXvdXrdXs 


iXu){Xy)iXr){Xs) 

(5.58) 


where for all 1 < u,v,r,s < p, A« is the u-th component of A and Xuvrs = OuvrsX, with Ouvrs G [0,1]. 
We first study £ 4 , which can be written 


£4 = — 


1 /12 


E 


(z°*A)^ ^ 12 


4! (1 + 0-iAzO)4 (1 - 0)4 (1 - (1 - 0)-iAzO)4; ■ 


E 


[zfxf 


(5.59) 


By Proposition 1 of ICiuuerca and Sallourtil (|2015h . we have that, for all e > 0, there exists two positive 
absolute constants Mi and M 2 such that 


P 


k k 

EE(^?‘E<E 


(z?‘A)4 


Ml 04 


^ (1-k0-iAzO)4 AL204 ^ 




> 1 - e. 


Then, for the hrst term of the right-hand side of (15.5911 . applying Cauchy-Schwartz’s inequality, we obtain 
that 

(z°‘A)4_ _ _ 

M 2 04 


(1 + «-.AzO)‘ - S'”-'"’* - ^ ^ IIX'll'^tlAIlf 


E 


2 = 1 


Using assumptions (Al) (A2), together with the fact that ||A|| = 0]p{p^^‘^n ^Z^) given by Proposition [T] 
and p = o(n4/^), we obtain that the first term of (15.591) is ojp(l). In the same way we can demonstrate that 





























24 


Gabriela Ciuperca, Zahraa Salloum 


each term of (15.5911 is Oip(l), which implies that 84 , = ojp(l). 

Using notations given by (EH) and (El, we obtain that £\ = 2nA*'i/)° and £2 = nA*S° A. Then, relation 
(15.5811 becomes ELrifc{/3°) = — nA*S° A + £3 + Oip(l) and the lemma follows. ■ 


The following Lemma gives, under hypothesis Hq, an approximation for the Lagrange multiplier A and 
the asymptotic behaviour of sup]^<j<^ = Ojp(l). 

Lemma 6 Under the null hypothesis Hq, suppose that assumptions (Al), (A3)-(A6) are satisfied. Then for 
g > 4 and fixed 6 £ (0,1), we have supj^<j<„ = ojp(l) and 

A = (5.60) 

Proof. By Proposition [T1 we have that ||A|| = Oip{n~^^^p^^‘^). Note that, by Cauchy-Schwartz’s inequality 
and by Lemma E we have 

sup |A*z°| < ||A||r° = 

l<i<k 


and 

(1 - 6 )~^ sup |A*z°| < ||A||T° = OpirT^I'^p^l'^) 0 ]p{n^^‘^p^^'^) = 

k-\-l<j<n 

with 9 = k/n. These two relations together assumption (A5) involve supj^<j<„ | = Oip(l). 

We prove now relation (15.6011 . The limited development of (12.511 . in the neighbourhood of A = Op, up to 
order 3 can be written 


fc n 1 ” 

0p = + (i_0)2 


+ 


+ 


i=l 
1 


j=k+l 

fc n 

(l_g)3 E 


J=fc+1 


3=fc+l 


(0 + AzO)4S 


E O f—Ot \ \3 

Zi (zi A) 


(1 - 0 - A z°M 




(5.61) 


where A = mA, with u £ (0,1). _ 

Using Proposition 1 of lCiuperca and SalloumI ( 2015h . similarly as for the term £4 of Proposition E we can 
demonstrate easily that the last term of the right hand side of relation (15.6111 is ojp(l). We recall that 


R' 


0 _ 


- n93 12 


‘z° 


f- 


1 ( 1 -61)3 


E O /\t 0\2 


j=k-\-l 


(5.62) 


Then, using notations given by (lOl . E3 and (15.6211 we obtain that relation (15.6111 becomes — S5(A + 
Rn = Op- Thus 

A= (S°)-'(R°+i/>°)(l + ojp(l)). (5.63) 

We recall that T° = max i^i^fc |0~^||z°||, (1 — 0)~^||z°|||. Then, we have for R°: 


R„ 


< T°A‘ 


V n6»2 


E 


r,0„0t 

z,- z,- • 




E 

j = k+l 


Z o Z 


A < T°A*S° A. 


Using Lemma 4 of iLiu et al.l (l2013h and LemmaEwe obtain that ||Rn|| < 7’°||A||^7i(S°) = Oip(T°||A|p). 
On the other hand, by Proposition E we have that ||A|| = 0]p{n~^^^p^^^) and by Lemma Efha-t T° = 


0 ]p{n 


i/9pi/2^^ Then for ||R^||, we obtain that 


||R°||=o^(n(^-‘')V/")- 


(5.64) 
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Relation (15.601) follows from (15.631) and (15.641) . ■ 

The following result gives an asymptotic approximation for [(S°)“^ — and [(S°)“^ — 

Lemma 7 Under the null hypothesis (Hq), if assumptions (A3), (A4) and (A 6 ) hold, we have 

(z) ((S^)-' - (V°r'X = ((V0)-V°)oj,(l). 

(zzj ((S°)-i - (V°)-i)R° = ((V°)-iR°)oip(l). 

Proof, (i) By Lemma [1] we have that, under assumptions (A3) and (A4), for all e > 0 there exist Cq > 0, 
for (? > 4, such that 

2-1-9 


Furthermore, under assumption (A 6 ) we have 


inax (u,"!;)I 

1<U,V<P 


(5.65) 


We recall that, for a matrix A , ||A||i is the subordinate norm to the vector norm ||.||i. Using Lemma l{iii) of 
ICiuperca and Sallouml ( 201 ^ . Lemma[ 2 l relations (15.571) . (15.651) and the identity V°((S°)“^ — (V°)“^)'i/j° = 
(V°(S°)“^ - Ip)^/j°, we have 

||v°((s°)-^ - (v°)-iX|| = ||(V° - s°)(s°)-V°II < ||v° - s°||J(s°)-i 

| 7 ,(V° - S°)| • | 7 i(S°)-'| ■ I < Cp max 
||oip(l). 


< max 

l^r^p 

II ./0 


which implies that ((S°) ^ - (V°) = (V°) V°ojp(1). 

(ii) The proof of (ii) is similar to (i). 


The following lemma is needed for proving Proposition^ We recall that K° = Ip —(V°) (V°) 


Lemma 8 Under null hypothesis Hq, if assumption (A7) holds, then 

tr{Kn)‘^ = 0]p{p^n~^). 

Proof. We show that .E[tr(K°)^] = 0{p^n~^). For this, we write the matrix K° as 


T^O T ^ 0 ot t 

7=1 ^ ' 3=k-\-l 


0 Ot 


Then 


Et.=i Etl <r<s + (T^ e;=.+i <,rwlsf] 

-2n-i EEi Eti JE[<rwlr] + (1 - 0)-^ EE^+i +P- 


(5.66) 


For the first term of the right-hand side of (15.661) . using the independence of w° for all i = 1, • • • , n, we 
have that 


„2 Y 


z —i ■. ^ 02 

r,s=l 7=1 


^ Y (-§4 Y lE[wlrwlr-wlswls] + Y 

T’,S = 1 7=1 ^ 7 = fc+l 


0 0 \ 2 -| 
2 '^j,rWj,s) \ 


0 0 .0 0 n 

Wj j^Wj 'pWj gWj g \ j 


1 _ _ 1 1 

+ 7(2 Y Y ^{w°,rwls])^ 


z— i ^ 02 
r,s=l 7=1 


(1-0)= 


j=k+l 
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1 P P 

^ \ '' rrss I \ / rs\2 /->/ 2 —1\ 

= “ “ + 2^ (a ) = 0(p n ). 

T’,s=l r,s = l 

The last equation is due to assumption (A7) and to the fact that a'’® = 0, for r ^ s. 

For the second term of the right-hand side of (15.661) . by the fact that a’’'’ = 1, we have that 


2 

n 



{i-ey 


E 


0 0 
Wn rWn 7 


i=fc+i 


= E' 


=p. 


Then, using relations (I5.66L (15.6711 and (15.681) . we obtain that 

l?[tr(K^)] = O(p^n-i). 


(5.67) 


(5.68) 


By Markov’s inequality, Lemma yields. 
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